The Users Who Say 'Ni': Audience Identification in Chinese-language Restaurant Reviews
نویسندگان
چکیده
We give an algorithm for disambiguating generic versus referential uses of secondperson pronouns in restaurant reviews in Chinese. Reviews in this domain use the ‘you’ pronoun 你 either generically or to refer to shopkeepers, readers, or for selfreference in reported conversation. We first show that linguistic features of the local context (drawn from prior literature) help in disambigation. We then show that document-level features (n-grams and document-level embeddings)— not previously used in the referentiality literature— actually give the largest gain in performance, and suggest this is because pronouns in this domain exhibit ‘one-senseper-discourse’. Our work highlights an important case of discourse effects on pronoun use, and may suggest practical implications for audience extraction and other sentiment tasks in online reviews.
منابع مشابه
Culture, Imagined Audience, and Language Choices of Multilingual Chinese and Korean Students on Facebook
Multilingual users of social networking sites (SNSs) write in different languages for various reasons. In this paper, we explore the language choice of multilingual Chinese and Korean students studying in the United States on Facebook. We survey the effects of collectivist culture, imagined audience, and language proficiency on their language choice. Results show that multilingual users use lan...
متن کاملPredicting user rating for Yelp businesses leveraging user similarity
Users visit a Yelp business, such as a restaurant, based on its overall rating and often based on other factors like location, hours of location, type/cuisine or other attributes such as free Wifi. In addition to this, users gain useful insight for a Yelp business based on its top reviews and highlights. However, the average rating that a business has, or the top reviews/feedback as per certain...
متن کاملIncorporating Language Patterns and Domain Knowledge into Feature-Opinion Extraction
We present a hybrid method for aspect-based sentiment analysis of Chinese restaurant reviews. Two main components are employed so as to extract feature-opinion pairs in the proposed method: domain independent language patterns found in Chinese and a lexical base built for restaurant reviews. The language patterns focus on the general knowledge which is implicit contained in Chinese, thus can be...
متن کاملGodless words in a vassal’s language named Hallaj An Overview to “Shath” and their types in Hallaj’s poems
“Godless words” , “inspired paradox” and as mystics phrase it “Shath” is a language creation of a mystic’s enchanted soul’s excitement and necessity. This mystic is a person whom is confronted with an illogical subject on one hand and on the other hand he has to use logical language which is a deficient tool for describing the illogical subject. Today’s fair audience – Regardless of Controversy...
متن کاملBeyond the Stars: Improving Rating Predictions using Review Text Content
Online reviews are an important asset for users deciding to buy a product, see a movie, or go to a restaurant, as well as for businesses tracking user feedback. However, most reviews are written in a free-text format, and are therefore difficult for computer systems to understand, analyze, and aggregate. One consequence of this lack of structure is that searching text reviews is often frustrati...
متن کامل